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Introduction: 

Mounting  evidence  suggests  that  androgen  receptor  (AR)  signaling  is  critical  in 
all  phases  of  prostate  cancer  (PCa)  development,  including  disease  initiation,  androgen- 
dependent  tumor  growth  and  late  stage  castrate-resistant  disease.  Resistance  to 
androgen  ablation  therapies  is  not  accompanied  by  a  loss  of  AR,  but  rather  associated 
with  reemergence  of  androgen  signaling  and  re-expression  of  AR-regulated  genes 
(Balk,  2002;  Buchanan  et  al,  2001;  Debes  &  Tindall,  2004;  Feldman  &  Feldman,  2001). 
Although  a  recent  study  showed  that  the  general  androgen  signaling  signature  was 
attenuated  in  ablation-resistant  metastatic  PCa,  the  AR  target  genes  with  strong 
androgen  promoter-enhancer  elements  remained  active  (Tomlins  et  al,  2007).  Ablation- 
resistant  PCa  clones  were  selected  and  required  minimal  androgen  signaling  for 
continued  survival  in  the  castrated  state.  Three  studies  exemplify  the  AR-dependence  of 
ablation-resistant  PCa  cells.  First,  disruption  of  AR  expression  by  a  specific  antibody  or 
ribozyme  inhibited  proliferation  of  ablation-resistant  PCa  cells  in  the  absence  of 
androgens  (Zegarra-Moro  et  al,  2002).  Second,  increased  AR  expression  was 
necessary  and  sufficient  to  convert  androgen-sensitive  PCa  to  an  ablation-resistant 
state  (Chen  et  al,  2004).  Third,  specific  expression  in  mouse  prostate  epithelial  cells  of 
an  AR  transgene,  which  contained  a  gain-of-function  mutation  (with  increased  basal 
activity  and  response  to  coregulators)  resulted  in  PCa  development  in  100%  of  the 
animals  (Han  et  al,  2005).  These  lines  of  evidence  prove  that  aberrant  AR  signaling  is 
sufficient  to  cause  PCa  and  that  under  certain  conditions  the  AR  may  act  as  an 
oncogene.  Studies  funded  by  this  idea  development  grant  are  aimed  to  better 
understand  the  underlying  mechanisms  that  govern  this  activity  of  the  AR  along  with  its 
target  genes. 

Body: 


Based  on  the  reviewer’s  feedback  from  a  previous  progress  report  three  years 
ago  (2004-2005),  we  have  reformulated  our  specific  aims  to  accommodate  findings 
made  during  the  1st  year  of  funding.  The  new  SOW,  submitted  in  April  2006  and 
subsequently  accepted,  is  as  listed  below: 
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Revised  Statement  of  Work  for  W81XWH-04-1-0823  (April  2006) 

Two  specific  aims  and  5  tasks  were  formulated: 

Specific  Aim  #1:  To  identify  non-steroidal  pathways  modulating  AR  activity. 

Specific  Aim  #2:  To  elucidate  how  non-steroidal  signaling  determines 
transcription  complex  compositions  and  histone  code  modifications  in  regulatory 
areas  of  AR  target  genes. 


Task  1:  To  analyze  AR  target  gene  chromatin  architecture  in  the  LNCaP/C4- 
2B  cell  culture  model  of  progression  to  androgen-independence  of 
prostate  cancer  cells  (months  1-12). 

a.  Compare  chromatin  integrated  gene  expression  with  transiently 
transfected  luciferase  reporters  (months  1-3). 

b.  Measure  histone  modifications  (acetylation  &  methylation)  across  the 
entire  PSA  locus  (months  4-6). 

c.  Check  gene  expression  of  PSA  and  histone  modifications  at  the  PSA  locus 
after  siRNA  knock-down  of  the  AR  (month  7-10). 

Task  2:  Develop  ChIP  and  Q-RT  PCR  assays  for  other  PSA  target  genes 
(months  10-15). 

Task  3:  Perform  similar  analyses  as  outlines  in  task  1  on  the  other  AR 
target  genes  (months  15-18). 

Task  4:  Perform  similar  analyses  as  outlines  in  task  1  using  another  cell- 
culture-independent  model  of  progression  to  androgen- 
independence,  namely  the  xenograft  CWR22  mouse  model  from 
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which  we  already  have  frozen  tissue  obtained  by  funding  from 
another  DoD  grant,  W81XWH-04-1-0049  (months  15-18). 

Task  5:  Write  manuscripts  (months  12-18). 

We  have  essentially  completed  both  aims  (also  see  progress  reports  2005,  2006  and 
2007  for  detail  of  the  older  work),  which  led  to  9  papers  (Jariwala  et  al,  2007;  Jia  et  al, 
2008;  Jia  et  al,  2004;  Jia  &  Coetzee,  2005;  Jia  et  al,  2006;  Kim  &  Coetzee,  2004;  Kim  et 
al,  2005;  Prescott  &  Coetzee,  2006;  Prescott  et  al,  2007). 


Task  2-5:  Determination  and  Assessment  of  novel  AR  target  genes 

We  have  completed  the  tasks  listed  above  (see  also  progress  reports  2005- 
2007).  Because  we  did  not  identify  many  AR  target  genes  and  came  to  the  end  of  the 
grant  (time  and  funds),  task  4  was  not  attempted. 

Here  I  will  summarize  data  obtained  from  a  chromosome-wide  scan  of  ARORs 
and  histone  H3  acetylation  across  chromosomes  18  &  20,  which  represents  an 
extension  of  task  1 .  The  published  paper  are  attached  as  an  appendix. 

Background:  The  androgen  receptor  (AR)  is  a  steroid-activated  transcription  factor  that 
binds  at  specific  DNA  locations  and  plays  a  key  role  in  the  etiology  of  prostate  cancer. 
While  numerous  studies  have  identified  a  clear  connection  between  AR  binding  and 
expression  of  target  genes  for  a  limited  number  of  loci,  high-throughput  elucidation  of 
these  sites  allows  for  a  deeper  understanding  of  the  complexities  of  this  process. 

Methodology/Principle  Findings:  We  have  mapped  189  AR  occupied  regions 
(ARORs)  and  1,388  histone  H3  acetylation  (AcH3)  loci  to  a  3%  continuous  stretch  of 
human  genomic  DNA  using  chromatin  immunoprecipitation  (ChIP)  microarray  analysis. 
Of  62  highly  reproducible  ARORs,  32  (52%)  were  also  marked  by  AcH3.  While  the 

Page  6 


COETZEE,  Gerhard  A.  Final  Report  W81XWH-04- 1-0823 

number  of  ARORs  detected  in  prostate  cancer  cells  exceeded  the  number  of  nearby 
DHT-responsive  genes,  the  AcH3  mark  defined  a  subclass  of  ARORs  much  more  highly 
associated  with  such  genes  -  12%  of  the  genes  flanking  AcH3+  ARORs  were  DHT- 
responsive,  compared  to  only  1%  of  genes  flanking  AcH3-  ARORs.  Most  ARORs 
contained  enhancer  activities  as  detected  in  luciferase  reporter  assays.  Analysis  of  the 
AROR  sequences,  followed  by  site-directed  ChIP,  identified  binding  sites  for  AR 
transcriptional  coregulators  FoxAl,  CEBP|3,  NFI  and  GATA2,  which  had  diverse  effects 
on  endogenous  AR  target  gene  expression  levels  in  siRNA  knockout  experiments. 

Conclusions/Significance:  We  suggest  that  only  some  ARORs  function  under  the 
given  physiological  conditions,  utilizing  diverse  mechanisms.  This  diversity  points  to 
differential  regulation  of  gene  expression  by  the  same  transcription  factor  related  to  the 
chromatin  structure. 

Key  Research  Accomplishments  during  the  entire  grant  period: 

1.  Increased  AR  signaling  is  a  consequence  of  chromatin  modifications  at 
particular  loci,  contributing  to  the  transition  of  PCa  cells  to  ablation  resistance. 

2.  The  chromatin-modified  state  referred  to  above  depends  on  sustained  AR 
activity. 

3.  ARORs  marked  by  histone  H3  acetylation  are  10  times  more  likely  to  be 
involved  in  gene  regulation  in  prostate  cancer  cells. 

4.  Most  ARORs  in  prostate  cancer  cells  are  either  not  active  or  poised  to 
activate  gene  expression  as  conditions  may  change. 


Reportable  Outcomes: 

See  publication  list  below. 
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Conclusions: 

Over  the  past  several  decades  a  general  concept  of  TF-mediated  gene 
regulation  has  emerged.  It  depicts  a  sequence  of  events,  starting  with  TFs  binding  at 
specific  regions  of  DNA,  followed  by  the  covalent  modification  of  chromatin  proteins 
(primarily  histones)  to  ‘open’  chromatin,  and  ultimately  allowing  the  nearby  docking  of 
the  transcription  machinery.  This  is  not  necessarily  always  the  case  and  alternative 
insights  were  recently  revealed  by  the  use  of  new  high  throughput  techniques,  which 
allow  the  agnostic  mapping  of  such  regions  and  an  unbiased  appreciation  of  their 
functions.  In  the  current  study,  many  AR  binding  sites  were  not  associated  with  nearby 
gene  expression  despite  the  fact  that  they  had  intrinsic  positive  regulatory  activity  in 
chromatin-independent  assays.  On  the  other  hand,  those  that  regulated  the  expression 
of  nearby  genes  were  10-times  more  likely  marked  by  acetylation,  indicating  that  AR 
occupancy  and  associated  histone  acetylation  are  both  necessary  requirements  for 
positive  regulatory  competence.  The  existence  of  silent  AR  occupied  regions,  however 
indicates  that  AR  occupancy  does  not  necessarily  lead  to  the  recruitment  of  histone 
acetylases  and  subsequent  nearby  gene  expression.  Therefore,  the  rate-limiting  step  for 
AR-mediated  gene  expression  is  not  occupancy  per  se.  Silent  occupied  regions  may  act 
as  a  reservoir  of  bound  AR,  possibly  poised  to  mediate  gene  expression  as  conditions 
change.  Such  conditions  may  include  chromatin  modifications  and/or  the  recruitment  of 
coregulators,  since  most  of  the  AR  occupied  regions  were  also  co-occupied  by  other 
TFs,  which  in  several  cases  acted  as  coregulators  of  gene  expression.  We  conclude 
that  the  diversity  of  site-specific  functions  of  the  AR  points  to  differential  regulation  of 
gene  expression  by  the  same  transcription  factor  related  to  chromatin  structure  and  the 
presence  of  coregulators. 

Appendices: 

(Jia  et  al,  2008) 
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Abstract 

Background :The  androgen  receptor  (AR)  is  a  steroid-activated  transcription  factor  that  binds  at  specific  DNA  locations  and 
plays  a  key  role  in  the  etiology  of  prostate  cancer.  While  numerous  studies  have  identified  a  clear  connection  between  AR 
binding  and  expression  of  target  genes  for  a  limited  number  of  loci,  high-throughput  elucidation  of  these  sites  allows  for  a 
deeper  understanding  of  the  complexities  of  this  process. 

Methodology/Principal  Findings:  We  have  mapped  189  AR  occupied  regions  (ARORs)  and  1,388  histone  H3  acetylation 
(AcH3)  loci  to  a  3%  continuous  stretch  of  human  genomic  DNA  using  chromatin  immunoprecipitation  (ChIP)  microarray 
analysis.  Of  62  highly  reproducible  ARORs,  32  (52%)  were  also  marked  by  AcH3.  While  the  number  of  ARORs  detected  in 
prostate  cancer  cells  exceeded  the  number  of  nearby  DHT-responsive  genes,  the  AcH3  mark  defined  a  subclass  of  ARORs 
much  more  highly  associated  with  such  genes  -  12%  of  the  genes  flanking  AcH3+ARORs  were  DHT-responsive,  compared 
to  only  1%  of  genes  flanking  AcH3-ARORs.  Most  ARORs  contained  enhancer  activities  as  detected  in  luciferase  reporter 
assays.  Analysis  of  the  AROR  sequences,  followed  by  site-directed  ChIP,  identified  binding  sites  for  AR  transcriptional 
coregulators  FoxAl,  CEBP(3,  NFI  and  GATA2,  which  had  diverse  effects  on  endogenous  AR  target  gene  expression  levels  in 
siRNA  knockout  experiments. 

Conclusions/Significance:  We  suggest  that  only  some  ARORs  function  under  the  given  physiological  conditions,  utilizing 
diverse  mechanisms.  This  diversity  points  to  differential  regulation  of  gene  expression  by  the  same  transcription  factor 
related  to  the  chromatin  structure. 
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Introduction 

The  ‘textbook’  paradigm  of  gene  regulation  by  steroid  hormone 
receptors  entails  the  binding  of  receptors  to  hormone  response 
elements  located  5 '-upstream  of  the  transcription  start  sites  (TSSs) 
of  responsive  genes,  followed  by  the  recruitment  of  non-DNA- 
binding  coactivators  or  corepressors.  These  latter  factors  modify 
histones  and  interact  with  the  basal  transcriptional  machinery  to 
modulate  transcriptional  initiation  [1],  This  paradigm  is  based  on, 


and  has  been  the  basis  for,  many  studies  in  which  steroid  hormone 
receptor  binding  motifs,  5 '-upstream  of  TSSs  of  target  genes,  were 
identified  and  characterized.  Thus,  DNA  binding  locations  for 
steroid  receptors  were  found  where  investigators  looked  for  them. 
More  recent  data  however  have  revealed  that  the  distribution  of 
steroid  receptor  occupancy  genome-wide  has  no  preference  for  5'- 
flanking  sequences  of  annotated  genes  [2,3].  Moreover,  the 
distribution  of  such  sites,  including  androgen  receptor  (AR)- 
occupied  regions  (ARORs)  on  chromosomes  21  &  22  [3],  is  poorly 
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correlated  with  gene  density.  For  this  reason,  two  studies  that  used 
genomic  windows  around  the  TSSs  of  annotated  genes  may  have 
been  limited  in  their  capacity  to  assign  ARORs  in  a  genome-wide 
fashion  [4,5]  by  missing  functional  ARORs  far  away  on  linear 
DNA.  Nonetheless,  it  is  not  a  trivial  task  to  assign  a  priori 
functionality  to  ARORs  if  they  are  not  directly  associated  with 
promoters  of  nearby  genes.  One  potential  way  to  better  assign 
functionality  to  transcription  factor  binding  sites,  including 
ARORs,  is  to  make  use  of  chromatin  analyses.  In  particular, 
ARORs  containing  active  histone  modifications  may  function  as 
enhancers,  modulating  transcription  at  a  distance. 

Histone  covalent  modifications,  also  referred  to  as  chromatin 
epigenetics,  are  involved  in  normal  somatic  cell  development  as 
well  as  in  the  progression  of  diseases,  such  as  cancer  [6,7].  Such 
histone  modifications  or  ‘marks’  lead  to  the  recruitment  and 
subsequent  docking  of  regulatory  protein  complexes  that  modulate 
transcription.  These  marks  provide  a  nuanced  chromatin 
‘language’  that  demarcate  chromatin  into  structural  domains  with 
dynamic  functional  consequences  [8] .  Systematic  studies  of 
chromatin  modifications  have  revealed  a  complex  landscape 
including  ‘punctate’  sites  of  modified  histones  (histone  H3  lysine 
9/14  acetylation  and  lysine  4  di-  and  trimethylation)  at 
transcription  start  sites  and  distal  regulatory  elements  [9,10]. 
Histone  H3  acetylation  is  an  epigenetic  mark  that  allows 
prediction  of  functional  regulatory  elements  [11],  The  molecular 
mechanisms  of  how  chromatin  remodeling  modulates  transcrip¬ 
tion  in  the  context  of  many  diseases,  including  prostate  cancer 
(PCa),  will  undoubtedly  have  a  major  impact  in  disease 
understanding  and  management.  In  the  case  of  PCa,  it  is  known 
that  the  progression  to  the  fatal  stage  of  the  disease,  ablation 
resistance,  depends  to  a  large  degree  on  the  activity  of  the  AR  [12- 
16],  a  potent  transcription  factor  (TF)  that  facilitates  epigenetic 
control  of  gene  expression  at  many  target  loci  across  the  entire 
human  genome. 

With  the  development  of  high  throughput  methodologies  such 
as  ChIP-chip  (reviewed  in  [17])  and  more  recently  GhIP-seq  (e.g. 
[18,1 9]),  it  has  become  possible  to  comprehensively  map  regions  in 
the  genome  of  mammalian  cells  that  are  occupied  by  TFs  of 
interest.  Often,  such  studies  identify  many  regions  that  are 
occupied  without  any  apparent  functional  consequence  [20,2 1] . 
To  determine  genomic  AR  binding  sites  and  better  understand 
their  biological  significance,  we  mapped  both  AR-occupied 
regions  (ARORs)  as  well  as  histone  H3  acetylation  (AcH3)  states 
chromosome-wide  (large  parts  of  chromosomes  19  and  20)  in  the 
C4-2B  aggressive  PCa  cell  line.  We  chose  the  AcH3  mark,  as  it  is  a 
well-characterized  marker  of  active  enhancers  and  promoters 
[10,22].  Of  the  ARORs  identified,  only  a  subset  has  clear 
functional  consequences  in  the  PCa  cell  line  analyzed,  indicating 
the  existence  of  diverse  AR  functions  under  changing  physiological 
circumstances. 

Results 

AR-occupied  regions  in  prostate  cancer  cells  exceed 
number  of  DHT-responsive  genes 

We  performed  three  independent  AR  ChIP-chip  analyses  using 
a  genomic  tiling  microarray  (NimbleGen  Systems  Inc.,  Madison, 
WI)  that  covers  sequences  on  chromosomes  19  and  20  that  total 
about  3%  of  the  human  genome,  including  the  classical  AR  target 
gene  PSA/KLK3.  We  implemented  a  novel  peak-calling  algorithm 
diat  allowed  us  to  identify  189  genomic  regions  (about  500  bp  on 
average)  occupied  by  the  AR  in  at  least  two  of  the  three 
experiments,  with  62  [level  1  (L 1 )  ARORs]  being  reproduced  in  all 
three,  and  127  [level  2  (L2)  ARORs]  being  reproduced  in  exacdy 


two  of  the  three  (Figure  1A).  Their  coordinates  are  provided  in 
Table  SI,  where  they  are  numbered  consecutively  from  A001  to 
A189.  The  AROR  peaks  were  generally  robust  in  multiple 
sampling  experiments  (see  Materials  and  Methods).  Frequently, 
ARORs  were  classified  as  L2  and  not  LI  because  of  missing 
probes  imposed  by  repetitive  sequences  and  requirements  of  the 
NimbleGen  design  criteria.  For  instance,  AROR  A042  encom¬ 
passes  the  well-characterized  PSA/KLK3  enhancer  [23-25],  but 
was  called  as  an  L2  AROR  primarily  because  it  partly  overlaps  a 
repeat-masked  region  (Figure  IB).  A043  identifies  a  novel  AROR 
at  the  KLK  locus  (Figure  IB),  and  four  other  examples,  ARORs 
A033,  A067,  A128  and  A129,  are  displayed  in  Figure  1C.  The 
AROR  calling  in  our  ChIP-chip  analysis  has  a  very  low  false 
positive  discovery  rate,  which  is  reflected  by  the  fact  that  21/21  LI 
ARORs  (Figure  IE)  and  6/7  L2  ARORs  (Figure  IF)  could  be 
validated  by  site-specific  independent  ChIP  assays.  For  the  four 
negative  control  (NC)  regions,  two  of  which  (NCI  and  NC2) 
contain  canonical  androgen  response  elements  (ARE),  no  AR 
occupancy  was  detected  (Figure  IF).  Importantly,  with  the 
exceptions  of  A039  and  A129  (Figure  IE),  we  found  little  evidence 
of  ligand-independent  AR  occupancy,  which  is  in  line  with 
previous  observations  at  the  canonical  AR  responsive  enhancer  for 
the  PSA  gene  [24,26].  These  exceptions,  however,  may  have 
important  implications  for  ligand-independent  signaling  of  the  AR 
as  part  of  the  ablation-resistant  phenotype  in  advanced  prostate 
tumors. 

Two  replicates  for  the  AcH3  ChIP-chip  experiment  were 
sufficient  because  they  were  highly  reproducible  -  of  1,388  regions 
identified  within  the  two  replicates,  1,189  (>90%)  were  in 
common  (Figure  1A).  This  high  reproducibility  is  attributable  to 
the  stability  of  this  lineage-specific  histone  modification.  Mapping 
of  the  AcH3  peaks  to  the  human  genome  show  that  48%  of  them 
overlap  with  TSSs  (Figure  ID,  insert).  In  contrast,  very  few 
ARORs  coincide  with  TSSs,  indicating  that  AR  rarely  occupies 
proximal  promoters.  In  fact  our  ARORs,  like  the  ER-occupied 
regions  (ERORs)  observed  in  MCF7  breast  cancer  cells  [2],  were 
distributed  randomly  with  respect  to  distance  from  TSSs 
(Figure  ID)  and  were  thus  mainly  located  in  introns  or  intergenic 
sequences  (Figure  ID,  insert).  This  distribution  was  veiy  similar  to 
that  obtained  for  ARORs  in  chromosomes  21  &  22  of  LNCaP  cells 
(Figure  SI)  [3].  In  terms  of  evolutionary  conservation,  both  the 
ARORs  and  ERORs  were  only  slightly  more  conserved  across 
evolution  than  a  set  of  random  control  sequences  that  exclude 
repetitive  sequences  not  represented  on  the  tiling  array  (Figure  S2). 

Extrapolation  of  our  chromosome  19/20  results  suggests  that 
under  similar  experimental  and  computational  stringency  we 
would  expect  in  the  human  genome  just  over  2,000  ARORs  that 
would  be  classified  as  LI  and  around  6,300  as  L2.  Interestingly, 
these  estimates  well  exceeded  the  number  of  DHT-responsive 
genes  in  C4-2B  cells  as  assessed  using  genome-wide  gene  Illumina 
expression  arrays  (primary  data  presented  in  Table  S7).  Even 
when  responsiveness  was  conservatively  defined  using  a  t-test 
statistical  Expectation  Value  (E-value)  cutoff  of  150  (permutation- 
adjusted  p-value  =  0.003),  only  552  of  46,713  transcripts  (1.2%) 
were  identified  as  stimulated  and  416  (0.9%)  repressed  by  DHT  at 
16  hours  compared  to  vehicle  control  (Figure  S3  and  Table  S2). 
When  we  confined  the  analysis  to  the  1,232  RefSeq  transcripts 
corresponding  to  the  ChIP-chip  analyzed  areas  of  chromosomes 
19/20,  only  24  (1.9%)  were  stimulated  and  19  (1.5%)  repressed  by 
DHT  at  the  E<5  (permutation-adjusted  p-value  =  0.004)  level 
(Figure  2).  Yet,  of  the  178  transcripts  containing  or  adjacent  to  an 
LI  or  L2  AROR,  only  16  (9.0%)  were  regulated  by  DHT  (12 
stimulated,  4  repressed).  Quantitative  RT-PCR  time-course 
analyses  of  four  of  these  genes  ( KLK3 ,  PYGB,  TGM2  and  SERINC3) 
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Figure  1.  Characterization  of  ARORs  and  AcH3  regions  on  chromosome  19  and  20.  Three  replicate  ChIP-chip  experiments  identified  738 
Androgen  Receptor  Occupied  Regions  (ARORs),  62  of  which  were  common  to  all  three  replicates  (LI  ARORs)  and  1 27  common  to  only  two  of  three 
(L2  ARORs),  while  two  replicate  ChIP-chip  experiments  identified  1,388  regions  with  acetylated  histone  H3  marks,  1,189  of  which  were  present  in  both 
replicates  (A).  Genome  plots  are  shown  for  the  kallikrein  locus  (B)  and  three  other  AROR-containing  loci  (C),  where  AR-ChIP  peaks  are  labeled,  and  raw 
log2  ratios  [from  0  (1-fold)  to  2  (4-fold)]  for  each  replicate  are  shown  in  green  (AR-ChIP)  and  blue  (AcH3-ChlP).  Panel  (D)  shows  the  genomic 
positioning  of  the  189  L1/L2  AROR  peaks  and  1,189  AcH3  peaks.  A  cumulative  distribution  plot  (outer)  shows  that  the  distance  from  annotated 
transcription  start  sites  (TSSs)  is  similar  between  ARORs,  Estrogen  Receptor  Occupied  Regions  (ERORs)  from  [2],  and  randomly  selected  regions  from 
the  repeat-masked  tiling  array,  while  a  majority  of  AcH3  peaks  are  located  at  or  near  TSSs.  All  three  classes  (AROR,  EROR,  and  AcH3)  are  excluded  from 
exons  relative  to  randomly  selected  regions  (insert).  The  selected  ARORs  were  validated  by  independent  ChIP-qPCR  (E-G).  C4-2B  cell  were  incubated 
in  phenol  red-free  RPMI  1640  containing  5%  CSS  for  3  days  and  then  treated  with  10  nM  DHT  or  ethanol  (EtOH)  vehicle  for  4  h.  Conventional  site- 
specific  ChIP  assays  were  performed  with  anti-AR  antibody.  Normal  IgG  was  used  in  parallel.  Twenty-one  LI  ARORs  (E),  7  L2  ARORs  (F)  and  4  negative 
control  (NC)  (see  table  S4)  (G)  regions  were  examined  by  TaqMan  qPCR.  Acetylated  ARORs  are  indicated  by  asterisks.  A042  (F)  is  the  PSA  enhancer  and 
acted  as  a  positive  control.  All  values  are  presented  as  percentage  of  input. 
doi:1 0.1 371/journal.pone.0003645.g001 


(Figure  2  insert)  verified  both  the  Ulumina  expression  data  and  the 
choice  of  the  16-hour  time  point  after  DHT  treatment.  As  such,  it 
appears  that  the  majority  of  ARORs  have  no  nearby  genes 
responsive  to  DHT  in  these  prostate  cancer  cells,  begging  the 
question  of  their  functionality. 

Histone  H3  Acetylation  defines  a  subclass  of  ARORs  more 
highly  associated  with  DHT-stimulated  genes 

AR  occupancy  at  many  regions  identified  in  our  study  does  not 
lead  to  alterations  in  expression  of  nearby  genes  as  a  function  of 
DHT  exposure.  Since  AcH3  is  a  landmark  of  functional  enhancers 
[10],  we  examined  the  AcH3  state  of  the  ARORs  to  determine 
their  possible  functionality.  Critically,  52%  of  LI  ARORs,  27%  of 
L2  ARORs  and  12%  of  L3  ARORs  (ARORs  detected  in  only  1  of 
3  ChIP-chip  experiments)  overlapped  with  AcH3  peaks,  which  is 
progressively  in  excess  of  the  6%  overlap  predicted  for  random 
association  (Figure  3A). 


To  determine  whether  AROR  acetylation  has  any  power  to 
predict  DHT  responsiveness  of  nearby  genes,  we  compared  the 
frequency  of  stimulated,  repressed,  and  unaffected  genes  at  loci 
that  contain  an  acetylated  AROR(s),  non-acetylated  AROR(s),  or 
no  AROR  at  all.  The  results  (Figure  3B)  show  that  AcH3+ARORs 
are  relatively  enriched  for  DHT-stimulated  transcripts.  Specifical¬ 
ly,  12.5%  of  the  genes  flanking  AcH3+ARORs  were  DHT- 
stimulated  (note  that  the  expected  maximum  is  less  than  100%,  as 
we  don’t  necessarily  expect  an  intergenic  AROR,  even  if 
functional,  to  target  transcripts  both  upstream  and  downstream 
from  the  AROR).  This  is  highly  enriched  (p  =  1.4x  10  7  by  Fisher 
exact  test)  relative  to  AcH3-  ARORs  and  control  regions  without 
an  AROR,  for  both  of  which  only  1.1%  of  adjacent  genes  were 
DHT-stimulated.  There  was  no  statistically  significant  relationship 
between  repressed  genes  and  association  with  ARORs,  irrespective 
of  their  acetylation  status.  Conversely,  11  of  24  up-regulated  genes 
were  adjacent  to  an  acetylated  AROR,  while  only  1  of  24  was 
adjacent  to  an  un-acetylated  AROR  (Figure  2).  While  these  results 
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Figure  2.  DHT-responsive  genes  in  C4-2B  cells.  Illumina  expression  arrays  were  used  to  measure  expression  levels  of  46,713  transcripts  in  three 
replicates  before  and  three  replicates  after  DHT  exposure  in  C4-2B  cells,  including  the  1,232  RefSeq  transcripts  within  our  chromosome  19/20  genome 
tiling  arrays,  which  are  shown  here.  The  student's  t-test  was  used  to  determine  statistical  significance,  and  p-values  were  adjusted  based  on  random 
permutations  of  the  full  dataset.  This  volcano  plot  shows  the  E-value  (number  of  transcripts  at  the  given  p-value  expected  by  chance)  plotted  against 
the  mean  fold  change.  24  transcripts  up-regulated  at  the  E  =  5  level  (permutation-adjusted  p-value  =  0.003)  and  19  transcripts  down-regulated,  are 
shown  in  the  upper  two  quadrants.  Transcripts  are  color-coded  based  on  whether  they  are  adjacent  to  an  AROR,  and  the  up-regulated  transcripts 
show  an  elevated  number  adjacent  to  acetylated,  but  not  un-acetylated,  ARORs.  The  inset  shows  a  time  course  of  endogenous  gene  expression.  C4- 
2B  cells  were  cultured  in  hormone-depleted  medium  for  3  days  and  then  treated  with  DHT  (10  nM)  or  ethanol  vehicle  for  the  indicated  times. 
Expression  levels  of  4  representative  genes  were  measured  by  real-time  RT-PCR.  The  data  is  normalized  to  18S  expression  in  log  scale;  Values  are  fold 
changes  over  the  vehicle  control  at  each  time  point. 
doi:1 0.1 371/journal.pone.0003645.g002 
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Figure  3.  Histone  H3  acetylation  defines  a  distinct  subclass  of  ARORs.  Histone  H3  acetylation  peaks  overlap  52%  of  the  most  reproducible 
LI  ARORs,  but  only  27%  of  the  L2  ARORs,  12%  of  the  L3  ARORs,  and  6%  of  random  sequences  from  the  area  covered  by  the  chromosome  19/20  tiling 
array  (A).  Transcripts  adjacent  to  acetylated  ARORs  are  significantly  more  likely  to  be  up-regulated  in  C4-2B  cells,  with  12.5%  of  transcripts  showing 
up-regulation,  as  opposed  to  1.1%  of  those  adjacent  to  un-acetylated  ARORs,  and  1.1%  of  those  not  adjacent  to  any  AROR;  this  was  not  the  case  for 
repressed  genes  (B).  While  un-acetylated  L1/L2  ARORs  are  about  as  likely  as  randomized  controls  to  cluster  on  the  genome  as  shown  by  the 
cumulative  distribution  plot  of  inter-AROR  distances  (C),  acetylated  ARORs  show  significantly  more  genomic  clustering  (D). 
doi:1 0.1 371/journal.pone.0003645.g003 
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demonstrate  a  strong  correlation  between  the  acetylation  of  an 
AROR  and  DHT-mediated  stimulation  of  nearby  gene(s),  the 
majority  of  genes  located  in  close  proximity  to  ARORs,  even 
acetylated  ones,  were  unresponsive  (Figure  3B). 

We  additionally  found  that  acetylated  ARORs  (Figure  3D)  were 
more  likely  to  occur  in  clusters  than  were  non-acetylated  ARORs 
(Figure  3C)  (e.g.  PTGB  AROR  cluster  in  Figure  1A).  This  is 
consistent  with  a  transcriptional  role  for  all  the  acetylated  ARORs, 
since  enhancer  elements  with  established  functions  are  often 
clustered  to  assure  fidelity  of  transcriptional  enhancement  by 
providing  redundancy  or  synergy  among  individual  elements  [27], 


Most  ARORs  contain  potential  enhancer  activity 

We  suspected  that  in  many  of  the  ARORs  in  C4-2B  cells,  the 
AR  was  present  in  complexes  devoid  of  transcriptional  trans¬ 
activation  activity.  We  therefore  tested  the  DHT-mediated 
enhancer  activity  of  6 1  ARORs  in  transient  transfection  reporter 
assays.  These  61  ARORs,  encompassing  all  of  the  highly 
reproducible  (LI)  ARORs  except  A059  (which  is  in  the  middle 
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of  a  simple  tandem  repeat),  were  cloned  upstream  of  a  thymidine 
kinase  (TR)  minimal  promoter/luciferase  reporter  and  subjected  to 
luciferase  assay  in  the  presence  or  absence  of  DHT  exposure  as 
described  in  Materials  and  Methods.  As  shown  in  Figure  4A,  forty 
of  the  61  LI  ARORs  (66%)  displayed  DHT-dependent  enhancer 
activity  with  p<0.05  (single  tailed  t-test).  Of  these,  19  ARORs 
stimulated  activity  of  the  heterologous  TK  promoter  by  >  5-fold. 
Thus,  most  of  the  ARORs  have  intrinsic  enhancer  potential, 
although  in  most  cases  this  potential  is  either  not  materialized  or 
not  detectable  when  the  AROR  is  assessed  in  its  native  context 
based  on  responsiveness  of  adjacent  genes  in  C4-2B  cells  (Figure  2 
and  see  Discussion).  Even  in  the  luciferase  assays,  where  the 
ARORs  are  tested  in  a  relatively  promiscuous  environment,  we 
observed  significant  variability  in  enhancer  activity  (Figure  4A), 
which  was  not  attributable  to  the  level  of  occupancy  (Figure  IE). 

Combinatorial  regulation  is  a  fundamental  and  an  increasingly 
central  aspect  of  transcriptional  control  in  higher  eukaryotic  cells, 
and  we  therefore  hypothesized  that  (i)  the  ARORs  might  be 
enriched  for  binding  sites  for  both  AR  as  well  as  AR-coregulators, 
and  (ii)  the  presence  of  such  sites  modulated  responsiveness  to 
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Figure  4.  Most  ARORs  have  enhancer  potential  (transactivation  activity),  which  is  associated  with  several  sequence  motifs.  Transient 
transfection  luciferase  reporter  assays  were  carried  out  in  duplicate,  and  repeated  independently  at  least  3  times  for  61  of  62  LI  ARORs  before  and 
after  DHT  exposure,  and  the  results  are  displayed  in  a  volcano  plot  (4A)  which  shows  the  fold  change  between  DHT+  and  DHT-  luciferase  levels  and 
the  p-value  significance  level  by  the  Student's  t-test.  At  the  p<0.05  level,  19  ARORs  (31%)  were  strongly  induced,  while  21  ARORs  (34%)  were  weakly 
induced.  From  LI  ARORs,  52  sequence  motifs  from  Transfac  and  de  novo  motif  discovery  algorithms  were  found  to  be  significantly  enriched  (see  text). 
We  performed  chi-square  tests  to  determine  those  motifs  (B,  upper)  and  motif  pairs  (B,  lower)  significantly  enriched  in  the  18  strongly  induced  (red 
bars)  vs.  the  19  non-responsive  (blue  bars)  ARORs.  Four  motifs  were  identified  (C),  including  binding  sites  for  FoxAl,  the  AR,  along  with  two  de  novo 
motifs  with  similarity  to  FoxAl,  NFI,  and  Octl. 
doi:1 0.1 371  /journal. pone.0003645.g004 
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DHT  in  a  combinatorial  manner.  To  pursue  the  first  hypothesis,  we 
searched  for  TF  binding  motifs  enriched  in  the  highly  reproducible 
LI  ARORs  compared  to  6200  equivalent  random  size-matched 
sequences  from  the  interrogated  genomic  territory  on  chromo¬ 
somes  19  and  20.  These  analyses  excluded  the  un-cloned  AROR 
A059,  as  well  as  A002  that  contains  14  tandem  canonical  AREs 
and  would  skew  the  analysis.  The  set  of  60  LI  ARORs  were 
investigated  using  both  scanning  and  de  novo  motif-fmding 
approaches.  Scanning  approaches  using  the  Cis-Elements  Anno¬ 
tation  System  (CEAS)  website  [28]  to  map  potential  binding  sites 
of  the  550  vertebrate  TF  binding  motifs  represented  in  the 
TRANSFAC  database  [29],  identified  five  potential  TF  binding 
sites  highly  over-represented  within  the  ARORs.  These  were  ARE 
(3.6-fold  enrichment,  p  =  7.2x10  12),  FINF3oi/FoxAl-binding 
motif  (1.8-fold,  p  =  2.2  x  10-16),  NFI-binding  motif  (1.9-fold, 
p  =  2.7xl0-9),  GRE  (1.5-fold,  p  =  2.5 xlO-12),  and  C/EBP  (1.4 
fold,  p  =  4.3x10  13)  (Table  S3).  Nonetheless,  only  10  of  60  LI 
ARORs  (17%)  contained  AREs  that  could  be  considered  high 
stringency  (PWM  log-odds>8.5),  whereas  38  of  60  (63%) 
contained  AREs  with  low  stringency  (PWM  log-odds>5.0).  De 
novo  approaches  utilizing  BioProspector  [30],  MDscan  [31]  and 
Weeder  algorithms  [32]  yielded  a  total  of  525  recurring  motifs 
within  the  60  LI  ARORs.  To  collapse  the  many  similar  motifs  into 
similarity  groups,  we  employed  linear  hierarchical  clustering  based 
on  highly  overlapping  motifs  (see  Materials  and  Methods).  This 
analysis  resulted  in  45  motifs  (Table  S3),  of  which  three  (ARE, 
GRE  and  FoxAl)  had  been  found  by  CEAS  as  detailed  above. 

We  next  compared  the  distribution  of  the  50  motifs  from  both 
scanning  and  de  novo  motif-fmding  approaches,  as  well  as  two 
alternate  ARE  versions  from  the  ConSite  [33],  between  18 
ARORs  (A002  was  excluded  from  the  19  strongly  induced 
ARORs,  not  to  skew  the  data)  most  responsive  to  DHT  in 
luciferase  enhancer  assays  with  the  19  least  responsive  (Figure  4A). 
Four  motifs  were  significantly  enriched  (p<0.01,  t-test)  in  the 
strongly  responsive  ARORs  (Figure  4B):  ARE  (2.2-fold),  FoxAl 
(1.8-fold),  and  two  de  novo  motifs  with  consensus  sequences  of 
TTGCTT  (2.2-fold)  and  TTGGCAAATA  (3.2-fold)  (Figure  4B 
and  C,  Figure  S4).  The  TTGGCAAATA  motif  appears  to 
resemble  overlapping  binding  motifs  for  NFI,  FoxAl  and  Oct-1 
(Figure  4C). 

Pair-wise  analysis  of  the  50  motifs  and  the  two  alternate  AREs 
identified  only  four  random  pair  combinations  significantly 
enriched  in  the  19  most  responsive  ARORs  (p<0.0005) 
(Figure  4B).  Two  of  these  ‘response  motif-pairs’  consisted  of  an 
ARE  with  either  the  TTGCTT  (4.6-fold  enrichment)  or 
TTGGCAAATA  (4.6-fold)  de  novo  motifs.  The  remaining  two 
did  not  have  AREs,  but  instead  consisted  of  a  FoxAl-binding  site 
with  either  TTGCTT  (4.6-fold)  or  the  TTGGCAAATA  (11.3- 
fold)  motifs  (Figure  4C). 

Combinatorial  regulation  by  AR  coregulators 

The  identification  of  motifs  enriched  in  ARORs  (Table  S3),  and 
in  particular  in  responsive  ARORs  (Figure  4B  and  C),  prompted 
us  to  investigate  the  role  of  FoxAl,  C/EBP,  NFI  and  Oct-1  in 
modulating  AR  genomic  action,  as  well  as  the  role  of  GATA2, 
which  has  been  recently  implicated  in  AR  genomic  action  in 
LNCaP  cells  [3].  We  tested  whether  these  TFs  are  constitutively 
present  at  ARORs  in  C4-2B  cells,  and  whether  occupancy  is 
affected  by  DHT-driven  AR  binding  as  tested  in  Figure  IE.  ChIP 
assays  were  performed  with  the  respective  antibodies,  and 
occupancy  was  determined  at  the  18  LI  ARORs  tested  in 
Figure  IE,  as  well  as  the  A042  L2  AROR  from  the  known  AR 
target  PSA/KLK3  locus.  As  shown  in  Figure  5,  occupancy  at  the 
basal  state  and  in  response  to  DHT  varied  significantly  as  a 
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Figure  5.  Occupancies  of  AR-coregulators  on  selected  ARORs. 

C4-2B  cells  were  cultured  in  hormone-depleted  medium  for  3  days  and 
then  treated  with  10  nM  DHT  or  ethanol  vehicle  for  4  h.  Conventional 
site-specific  ChIP  assays  were  performed  with  indicated  antibodies. 
Nineteen  ARORs  and  2  negative  control  (NC)  regions  were  examined  by 
qPCR.  The  values  are  presented  as  percentage  of  input. 
doi:10.1371/journal.pone.0003645.g005 


function  of  both  the  AROR  and  the  transcription  factor  of 
interest.  For  example,  FoxAl  was  present  at  almost  every  AROR 
with  occupancy  in  most  cases  increasing  following  treatment  with 
DHT.  Notable  exceptions  included  the  lack  of  FoxAl  recruitment 
to  AROR  A002  (which  contains  14  AREs),  and  the  inability  of 
DHT  to  enhance  FoxAl  occupancy  of  AROR  A071.  NFI  and  C/ 
EBPpi  occupancy  also  varied  considerably  among  ARORs,  but 
was  less  responsive  to  DHT.  C/EBPa  in  contrast,  which  has  been 
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found  at  ER-occupied  regions  in  MGF7  breast  cancer  cells  [2], 
was  not  detected  at  any  of  our  ARORs  in  C4-2B  prostate  cancer 
cells  (data  not  shown).  GATA2  occupied  only  acetylated  ARORs 
near  three  DHT-responsive  genes;  PSA  (A042),  PTGB  (A129)  and 
TGM2  (A156),  and  Oct-1  interaction  was  weakly  observed  on 
several  ARORs  with  only  one  of  three  different  antibody 
preparations.  Interestingly,  we  could  not  always  predict  occupancy 
of  any  particular  AROR  by  FoxAl ,  C/EBP,  NFI  and  Oct- 1  by  the 
presence  or  absence  of  consensus  motifs.  This  finding  is  suggestive 
of  complex  protein-protein  interactions  in  the  recruitment  of  a 
given  TF  to  a  specific  AROR,  and  further  indicates  that  the  site- 
specific  arrangement  of  TFs  creates  a  diversity  of  site-specific  AR- 
mediated  transcriptional  responses.  For  example,  A129  exhibited 
relatively  high  NFI  and  GATA2  occupancy,  but  low  to 
intermediate  C/EBP(3  and  FoxAl  levels,  A107  had  relatively 
high  C/EBPP,  low  to  intermediate  NFI  and  FoxAl  and  negligible 
GATA2,  whereas  A140,  was  highly  occupied  by  FoxAl  after 
DFIT  treatment  but  had  low  NFI,  intermediate  C/EBPP  and 
extremely  low  GATA2  occupancy  levels. 

The  C4-2B  cell  line  was  originally  obtained  from  LNCaP  cells 
grown  as  xenografts  after  castration  [34],  They  are  therefore 
considered  ‘androgen-independent’  although  AR-mediated  gene 
expression  is  still  quite  responsive  to  androgen  treatments  [24],  We 
compared  C4-2B  with  LNCaP  cells  with  respect  to  AR  and  TF 
occupancies  (Figure  S6).  The  occupancy  levels  in  the  absence  and 


presence  of  DHT  were  similar  between  the  two  cell  lines, 
indicating  that  the  AR  and  other  TF  occupancies  are  not 
dramatically  affected  by  the  androgen-dependent  status  of  the 
cells. 

Using  siRNAs  to  knock  down  the  levels  of  endogenous  FoxAl, 
C/EBP,  NFI  and  GATA2,  we  investigated  the  role  of  each  of 
these  transcription  factors  in  DHT-stimulation  of  AR  target  genes. 
Remarkably,  DHT  responsiveness  varied  with  both  the  TF 
investigated  and  amongst  the  five  tested  AR  target  genes  [PSA/ 
KLK3  (near  A042),  KLK2  (near  A043),  TGM2  (near  A156), 
TMPRSS2  (chromosome  21)  and  FKBP51  (chromosome  6)] 
(Figure  6)  in  a  manner  not  attributable  to  differential  occupancy 
(compare  Figures  5  and  6).  For  example,  knockdown  of  FoxAl 
decreased  DHT-responsiveness  of  TMPRSS2  and  FKBP51 , 
enhanced  that  of  TGM2  and  did  not  affect  PSA  and  KLK2 
expression.  In  contrast,  knockdown  of  NFI  decreased  DHT- 
responsiveness  of  PSA,  KLK2  and  TMPRSS2,  increased  that  of 
TGM2,  and  did  not  affect  FKBP51.  Knockdown  of  GATA2 
sensitized  TGM2  and  FKBP51  to  DHT  induction.  Interestingly, 
knockdown  of  C/EBPP  strongly  increased  basal  expression  of  PSA 
and  KLK2,  which  may  reflect  modification  of  AR  action  on  these 
genes  in  the  absence  of  added  ligand  [24],  We  had  previously 
analyzed  occupancy  at  ARORs  close  to  PSA  (A042)  and  TGM 
(A156)  (Figure  5);  it  is  interesting  to  note  that  occupancy  by 
FoxAl,  NFI,  C/EBPP,  and  GATA2  were  all  present  at  the 
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Figure  6.  Expression  of  AR  target  genes  after  siRNA-mediated  knockdown  of  coregulators.  C4-2B  cells  were  transfected  with  siRNA 
targeting  AR,  FoxAl,  NFI,  CEBPp,  GATA2  or  non-specific  (NS)  regions.  Two  days  after  transfection,  cells  were  treated  with  10  nM  DHT  or  ethanol 
vehicle  for  18  h.  Expression  levels  of  five  AR  target  genes  were  examined  by  real-time  RT-PCR.  Relative  mRNA  level  was  normalized  by  GAPDH  mRNA. 
Efficiency  of  each  siRNA  knockdown  was  measured  by  immunoblot  with  indicated  antibodies  (upper  right). 
doi:1 0.1 371  /journal. pone.0003645.g006 
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ARORs  close  to  the  genes  (re-plotted  as  Figure  S7),  and  likely 
exerted  their  actions  via  these  sites.  The  diverse  effects  of  each  of 
the  coregulators  on  AR  function  may  therefore  reflect  a  complex 
regulatory  mechanism  suitable  for  modulation  of  DHT  respon¬ 
siveness  under  different  physiological  conditions. 

Discussion 

The  functional  consequences  of  AR  engagement  at  DNA  target 
sites,  which  normally  follow  androgen  binding  to  the  AR,  are  of 
major  importance  in  the  development  of  the  male  phenotype  and 
in  diseases  such  as  PCa.  The  prostate  is  a  well-established  AR 
target  tissue,  where  it  is  known  that  5a-reductase  converts 
circulating  testosterone  to  the  more  potent  androgen,  DHT. 
DHT  in  turn  binds  the  AR,  mediates  its  nuclear  translocation,  and 
promotes  prostate  differentiation  via  specific  gene  expression 
modulation.  With  respect  to  PCa,  increased  life-time  exposure  to 
enhanced  AR  activity  predisposes  men  to  the  disease  [13].  Even 
more  striking,  aberrant  AR  activity  seems  to  be  necessary  and 
sufficient  to  convert  androgen-dependent  prostate  tumors  to 
ablation  resistant  ones;  this  fact  has  made  the  AR  an  attractive 
target  for  PCa  therapy  [16,35,36].  Although  non-genomic  actions 
of  the  AR  have  been  proposed  [37],  its  major  activity  is  thought  to 
be  a  consequence  of  DNA  engagement  and  transcriptional 
regulation.  Due  to  progress  in  genomic  techniques  measuring 
engagement  of  TFs,  the  quantitative  and  qualitative  mapping  of 
AR/DNA  engagement  profiles  have  become  the  basis  of  a  limited 
number  of  studies  during  the  past  year  [3-5] ,  however,  the  control 
of  such  engagement  and  its  consequences  remain  relatively 
unexplored. 

In  this  report,  we  employed  ChIP-chip  to  map  the  locations  of 
both  the  AR  and  AcH3  onto  3%  of  the  genome  in  C4-2B  prostate 
cancer  cells.  AR  occupancy  exhibited  higher  inter-experimental 
variance  than  AcH3,  but  was  still  sufficiently  reproducible  to 
detect  62  regions  occupied  in  every  one  of  three  independent 
experiments.  Aside  from  their  exclusion  from  exonic  sequences, 
ARORs  were  distributed  randomly  with  respect  to  gene  locations. 
Of  the  62  highly  reproducible  ARORs,  32  were  also  marked  by 
AcH3.  The  significance  of  AROR  acetylation  was  apparent  when 
examining  gene  expression  patterns  of  neighboring  genes  -  1 2  %  of 
genes  adjacent  to  AcH3+ARORs  were  DHT-stimulated,  com¬ 
pared  to  only  1  %  of  genes  adjacent  to  AcH3-  ARORs  (conversely, 
46%  of  up-regulated  transcripts  were  adjacent  to  acetylated 
ARORs  whereas  only  4%  were  adjacent  to  an  un-acetylated 
AROR.)  While  the  number  of  differentially  regulated  genes  on 
chromosomes  19  and  20  is  small,  this  roughly  10-fold  enrichment 
in  functionality  of  ARORs  with  the  acetylation  mark  illustrates  the 
utility  of  combining  ChIP  studies  of  individual  transcription  factors 
with  epigenetic  markers  in  the  same  cells. 

We  detected  many  more  ARORs  than  DHT-responsive  genes, 
leaving  most  ARORs  without  any  detectable  function  in  C4-2B 
cells.  We  can  speculate  that  these  ARORs,  particularly  the 
acetylated  ones,  modulate  transcription  at  levels  undetectable  in 
our  assay,  or  target  microRNAs  or  other  un-annotated  transcripts. 
They  may  even  function  as  enhancers  of  genes  over  large  linear 
DNA  distances  or  on  other  chromosomes  [38].  Transcriptional 
enhancement  of  other  genes  could  have  been  offset  by  compen¬ 
satory  mechanisms  of  RNA  destabilization,  or  difficult  to  detect 
due  to  high  basal  steady  state  levels.  Finally,  it  is  plausible  that 
occupancy  at  some  ARORs  was  without  any  transcriptional 
function  under  the  experimental  conditions  employed.  Some  of 
these  ARORs,  in  particular  acetylated  ARORs,  could  be  poised 
for  transcriptional  engagement  upon  arrival  of  a  missing  signal. 
Others  may  have  functions  unrelated  to  transcription,  such  as 


machinery 

promoter  1  1  gene 


machinery 

promoter  gene 


Figure  7.  Schematic  model  of  AROR  functions.  Three  types  of  AR/ 
DNA  engagements  are  envisioned.  Inactive  ARORs  represent  AR 
occupying  sites  of  relatively  condensed  chromatin  along  with 
coregulators  W  &  X;  these  sites  may  simply  be  reservoir  of  AR  to  be 
used  during  dramatic  changes  in  physiological  conditions.  Poised 
ARORs  represent  AR  at  AcH3  modified  sites  ready  to  engage  the 
transcription  initiation  machinery,  but  held  in  check  by  coregulaters  W 
&  Z.  Engaged  ARORs  represent  AR  actively  mediating  transcriptional 
control  of  target  gene(s)  by  looping  across  varying  genomic  distances 
perhaps  assisted  by  coregulators  Y  &  Z. 
doi:10.1371/journal.pone.0003645.g007 

chromosome  structural  regulation,  replication  or  DNA  repair  [21]. 
Still  others,  especially  non-acetylated  ARORs,  may  have  no 
function  at  all.  Thus,  we  envision  many  (even  most)  ARORs  are 
inactive  in  relatively  condensed  chromatin  at  any  given  time  and 
situation  with  or  without  coregulators  (Figure  7,  W  &  X);  the  latter 
my  act  as  pioneers  to  facilitate  the  original  recruitment  of  AR  to 
the  site.  Some  ARORs  may  be  poised  to  act  because  they  contain 
modified  chromatin  (AcH3)  and  a  different  complement  of 
coregulators  (Figure  7,  W  &  Z).  Only  a  small  number  of  ARORs 
is  actually  engaged  in  transcriptional  regulation  possibly  again  due 
to  the  presence  of  another  set  of  coregulators  (Y  &  Z)  that  may 
convert  poised  ARORs  to  engaged  ones.  We  propose  that  engaged 
ARORs  will  differ  in  activity  depending  on  the  physiological  and 
temporal  states  of  the  cells.  W,  X,  Y  &  Z  in  figure  7  represent 
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many  combinations  of  coregulators  acting  in  a  site-specific  manner 
allowing  for  qualitative  adjustment  of  AR  signaling  under  different 
situations. 

Transcriptional  enhancer  activity  of  ARORs  in  a  standard 
luciferase  assay  of  a  minimal  TK  promoter  demonstrated  that 
66%  of  the  LI  ARORs,  including  many  without  apparent 
influence  on  endogenous  genes,  had  transcriptional  enhancement 
potential  in  in  vitro  assays.  Indeed,  even  ARORs  that  had  no 
enhancer  activity  in  C4-2B  cells  turned  out  to  be  active  in  other 
prostate  cancer  cell  lines  (data  not  shown).  As  stated  above, 
whether  an  AROR  is  engaged,  poised,  or  transcriptionally 
inactive,  even  in  a  non-chromosomal  context  after  transient 
transfection,  likely  depends  on  the  differential  occupancy  of  the 
AROR  by  trans-acting  factors,  which  modify  the  transcriptional 
activity  of  the  AR.  This  was  best  exemplified  by  the  siRNA 
knockdown  of  specific  AR  coregulators  that  had  differential  effects 
on  different  DHT-responsive  genes  (Figure  6).  These  AR 
coregulators,  including  FoxAl,  C/EBPB,  NFI  and  GATA2,  were 
identified  here  and/or  in  other  studies  [3]. 

FoxAl  and  C/EBPa  have  both  been  implicated  previously  in 
estrogen  receptor  (ER)-mediated  [2]  and  FoxAl  in  AR-mediated 
[3]  gene  expression.  The  Matusik  lab  further  showed  that  FoxA 
proteins  were  expressed  in  the  prostate  and  interacted  with  the  AR, 
thus  modulating  its  activity  on  some  but  not  all  promoters  [39]. 
Although  FoxA2  may  play  a  role  in  prostate  cancer  progression  to 
androgen  independence  [40],  it  is  not  expressed  in  C4-2B  cells  (data 
not  shown).  FoxA  transcription  factors  may  act  as  pioneer  factors, 
due  to  their  winged-helix  structures,  and  may  bind  to  chromatin 
predominantly  at  distant  enhancers  as  an  initial  event  to  which 
subsequent  TF  may  be  recruited.  Their  binding  seems  to  be 
dependent  on  histone  F13K4  methylation  and  they  likely  establish 
lineage-specific  transcription  [41],  Our  data,  on  the  other  hand, 
have  shown  that  in  addition  to  FoxAl  being  attracted  to  certain 
ARORs  in  the  absence  of  hormone,  at  other  sites  FoxAl 
recruitment  was  increased  after  DFIT  treatment  indicating  that 
FoxAl  in  PCa  cells  has  additional  AR-dependent  functions. 

C/EBPs  (CCAAT-enhancer-binding  proteins)  belong  to  a 
family  of  transcription  factors  that  interact  with  the  CCAAT 
box  motif,  which  is  present  in  the  promoters  of  several  genes.  In 
LNCaP  cells,  C/EBPa  over-expression  inhibited  the  expression  of 
PSA  [42],  indicating  that  it  is  a  possible  AR  negative  regulator. 
However,  C/EBPfS  seems  more  relevant  in  cancer  cell  control  in  a 
positive  way,  since  it  is  hormonally  controlled  and  involved  in  cell 
proliferation,  whereas  C/EBPa  is  predominantly  expressed  in  the 
terminal  differentiation  phases  of  epithelial  cells.  For  example,  C/ 
EBP(3  appears  to  be  a  key  transcription  factor  in  cell  growth,  since 
its  depletion  by  RNA  interference  blocked  the  stimulation  of 
growth  hormone  [43].  In  our  study  C/EBPP  (and  not  C/EBPa) 
was  found  at  many  ARORs. 

Nuclear  factor  I  (NFI)  belongs  to  a  family  of  transcription  factors 
that  play  important  roles  in  modulating  transcription  of  differen¬ 
tiation-associated  genes  [44],  Interestingly,  a  two-step  synergistic 
model  was  proposed  to  explain  the  interaction  and  activity  of 
progesterone  receptor  and  NFI  [45].  Although  our  siRNA 
knockdown  experiments  indicated  diverse  effects  of  NFI  on  gene 
regulation,  specific  roles  for  NFI  in  AR  signaling  have  not  yet  been 
proposed,  and  we  believe  our  findings  are  novel  and  may  well  reveal 
an  important  pathway  in  AR  signaling  and  PCa  progression. 

GATA  family  members  are  able  to  bind  to  DNA  at  their 
response  elements  and  mediate  the  ‘opening’  of  compacted 
chromatin  [46]  and  have  been  proposed  to  play  major  roles  in 
endocrine  function  and  disease  [47].  Furthermore,  GATA2  was 
previously  found  as  an  AR  coregulator  at  AR  sites  on  chromosome 
21  and  22  [3],  However,  in  our  study  GATA2  was  found  only  at 


three  AROR  sites.  Interestingly,  all  three  were  in  the  proximity  of 
genes  that  were  substantially  stimulated  by  DHT  treatment  (PSA, 
TGM2  and  PYGB),  indicating  that  this  transcriptional  coregulator 
might  potentiate  the  AR-mediated  transcription  of  these  genes. 
However,  depending  on  the  specific  enhancer,  GATA2  might 
have  enhancing  or  repressing  activities  on  gene  expression,  as  was 
shown  in  the  siRNA  knockdown  experiments. 

Oct- 1  was  proposed  as  an  AR  collaborator  [3]  and  was  shown 
to  physically  interact  with  the  AR  in  a  DNA-dependent  manner 
that  facilitated  cofactor  recruitment  [48,49].  However,  we  found 
only  very  low  levels  of  Oct-1  at  some  ARORs,  indicating  that  the 
participation  of  Oct-1  is  limited  to  perhaps  only  very  few  AR 
regulated  regions  where  it  may  play  a  specialized  function. 

We  have  previously  reported  that  at  some  loci  (but  not  at  others) 
gene-wide  histone  H3  acetylation  leads  to  efficient  AR-mediated 
transcription  in  C4-2B  but  not  in  LNCaP  cells  [24],  Apart  from 
the  different  kallikrein  loci  (Figure  IB),  we  did  not  find  any  other 
gene-wide  acetylation  profiles  on  chromosomes  19  &  20, 
indicating  that  the  expression  of  kallikreins  is  perhaps  examples 
of  only  a  few  genes  marked  in  this  way.  From  the  work  presented 
here  it  is  clear  that  coregulators  also  contribute  significantly  to 
transcriptional  efficiency. 

The  inteiplay  between  the  AR  and  potential  coregulators  is  very 
likely  promoter/enhancer  specific  and  dependent  on  the  pheno¬ 
types  of  the  target  genes  as  was  shown  in  the  present  study.  Some  of 
the  AR  coregulators  may  be  considered  pioneer  factors,  laying  the 
groundwork  to  allow  efficient  AR-mediated  gene  modulation  as  was 
proposed  for  ER  activity  [2].  In  such  cases  they  may  even  provide 
the  initial  site  for  the  AR  to  dock  at  enhancers  or  promoters  that  do 
not  contain  strong  AREs;  in  this  scenario,  the  AR  is  brought  into 
play  via  its  interaction  with  the  coregulator,  with  weak  binding  to,  or 
even  no  contact  with  DNA.  A  complex  site-specific  matrix  of 
transcription  factors  may  therefore  exist  to  allow  specific  gene 
regulation  under  specific  physiological  conditions.  At  this  stage  we 
do  not  know  the  rules  governing  AR-mediated  expression  control  at 
different  loci.  As  a  matter  of  fact,  there  are  likely  dozens  more 
coregulators  that  determine  AR  activity  across  all  ARORs  and  cell 
types.  Deciphering  the  rules  of  AR  engagement  by  these 
coregulators  across  various  ARORs  and  under  different  physiolog¬ 
ical  and  pathological  conditions  may  require  decades  of  research. 
Meanwhile,  measuring  epigenetic  marks,  such  as  histone  H3 
acetylation,  provides  a  powerful  approach  as  a  functional  adjunct 
to  characterize  ARORs.  The  combined  mapping  of  TF-binding 
regions  and  histone  modifications  may  therefore  prove  a  beneficial 
principle  to  be  used  for  the  functional,  genome-wide  characteriza¬ 
tion  of  not  only  AR,  but  also  that  of  many  other  TFs. 

Conclusions 

Over  the  past  several  decades  a  general  concept  of  TF-mediated 
gene  regulation  has  emerged.  It  depicts  a  sequence  of  events, 
starting  with  TFs  binding  at  specific  regions  of  DNA,  followed  by 
the  covalent  modification  of  chromatin  proteins  (primarily 
histones)  to  ‘open’  chromatin,  and  ultimately  allowing  the  nearby 
docking  of  the  transcription  machinery.  This  is  not  necessarily 
always  the  case  and  alternative  insights  were  recently  revealed  by 
the  use  of  new  high  throughput  techniques,  which  allow  the 
agnostic  mapping  of  such  regions  and  an  unbiased  appreciation  of 
their  functions.  In  the  current  study,  many  AR  binding  sites  were 
not  associated  with  nearby  gene  expression  despite  the  fact  that 
they  had  intrinsic  positive  regulatory  activity  in  chromatin- 
independent  assays.  On  the  other  hand,  those  that  regulated  the 
expression  of  nearby  genes  were  1 0-times  more  likely  marked  by 
acetylation,  indicating  that  AR  occupancy  and  associated  histone 
acetylation  are  both  necessary  requirements  for  positive  regulatory 
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competence.  The  existence  of  silent  AR  occupied  regions,  however 
indicates  that  AR  occupancy  does  not  necessarily  lead  to  the 
recruitment  of  histone  acetylases  and  subsequent  nearby  gene 
expression.  Therefore,  the  rate-limiting  step  for  AR-mediated  gene 
expression  is  not  occupancy  per  se.  Silent  occupied  regions  may  act 
as  a  reservoir  of  bound  AR,  possibly  poised  to  mediate  gene 
expression  as  conditions  change.  Such  conditions  may  include 
chromatin  modifications  and/or  the  recruitment  of  coregulators, 
since  most  of  the  AR  occupied  regions  were  also  co-occupied  by 
other  TFs,  which  in  several  cases  acted  as  coregulators  of  gene 
expression.  We  conclude  that  the  diversity  of  site-specific  functions 
of  the  AR  points  to  differential  regulation  of  gene  expression  by 
the  same  transcription  factor  related  to  chromatin  structure  and 
the  presence  of  coregulators. 

Materials  and  Methods 

Cell  Culture  and  Materials 

Human  prostate  cancer  C4-2B  cells,  obtained  from  ViroMed 
Laboratories  (Minneapolis,  MN),  were  maintained  in  RPMI  1640 
supplemented  with  5%  (v/v)  fetal  bovine  serum  (FBS).  Antibodies 
were  anti-AR  (N20),  anti-HNF-3a  (H-120),  anti-NFI  (H-300),  anti- 
C/EBP|3  (C19),  anti-GATA2  (HI  16),  normal  rabbit  IgG  (Santa 
Cruz  Biotechnology,  Santa  Cruz,  CA),  anti-Oct  1  (Abeam  Inc., 
Cambridge,  MA)  and  anti-AcH3-K9/K14  (Upstate  Biotechnology 
Inc.,  Lake  Placid,  NY).  Pre-designed  SMARTpool  siRNA  reagents 
against  FoxAl,  NFI,  C/EBPfl,  GATA2  and  nonspecific  siRNA 
were  purchased  from  Dharmacon  (Lafayette,  CO).  AR  siRNA  was 
described  previously  [24],  TaqMan  qPCR  probes  were  obtained 
from  Biosearch  Technologies  (Novata,  CA). 

ChIP-chip 

C4-2B  cells  were  cultured  in  phenol  red-free  RPMI  1640 
supplemented  with  5%  charcoal/dextrane-stripped  FBS  (CSS)  for 
3  days.  After  4  h  DHT  (10  nM)  treatment,  ChIP  was  conducted  as 
described  previously  [23]  except  that  no  salmon  sperm  DNA  was 
used  as  blocking  reagent.  The  immunoprecipitated  DNA  and  un¬ 
enriched  input  DNA  were  treated  with  RNase  A  and  purified  using 
the  QIAquick  PCR  purification  kit  (Qiagen,  Valencia,  CA).  The 
purified  DNA  was  blunt-ended  using  T 4  polymerase  (New  England 
BioLabs,  Ipswich,  MA),  ligated  to  the  linkers  (oJW102,  5'- 
GCGGTGACCCGGGAGATCTGAATTC-3',  and  oJW103,  5'- 
GAATTCAGATC-3'),  and  amplified  by  ligation-mediated  PCR 
(LM-PCR).  NimbleGen  Systems  (Madison,  WI)  labeled  amplified 
input  DNA  and  ChIP  DNA  with  Cy3  and  Cy5,  respectively,  and 
hybridized  to  genomic  tilling  arrays  HG17Tilling  Set  35. 

Peak  calling 

Normalized  log-ratios  for  each  replicate  were  obtained  from 
NimbleGen  and  processed  independently.  Moving  averages  Mt 
were  determined  for  600-bp  windows  centered  on  each  probe  i,  for 
all  windows  containing  three  or  more  probes.  Statistically 
significant  windows  were  determined  by  comparing  actual  moving 
average  values  to  a  null  distribution  obtained  by  permutation 
sampling  as  follows.  For  each  actual  moving  average  M,  containing 
j  probes,  we  computed  a  null  moving  average  M ,  by  randomly 
sampling  j  log-ratio  values  from  the  entire  dataset.  This  was 
performed  10,000  times  to  generate  a  distribution  of  permuted 
moving  averages.  A  normal  QQ;plot  (Figure  S5)  indicated  that 
this  distribution  was  approximately  normal  except  for  outliers  in 
the  right  tail,  so  we  estimated  mean  (u)  and  variance  (62) 
parameters  and  performed  a  one-sided  normal  test  to  calculate 
individual  window  p-values  based  on  this  null  distribution.  To 
accommodate  the  large  number  of  statistical  tests,  we  used  the 


method  of  Benjamini  et.  al.  to  control  the  False  Discovery  Rate 
(FDR)  at  an  FDR  (a)  level  of  0.005.  Regions  with  at  least  four 
consecutive  probes  were  considered  a  peak,  then  peaks  were 
extended  until  the  first  insignificant  probe  was  encountered,  and 
overlapping  peaks  were  merged.  For  AcH3  occupied  regions,  final 
peaks  were  those  that  were  called  in  both  of  the  replicate 
experiments.  For  AR  occupied  regions  (ARORs),  LI  peaks  were 
defined  as  those  present  in  all  diree  replicate  experiments,  while 
L2  peaks  were  those  present  in  two  of  three. 

Genomic  properties  of  AcH3  peaks  and  ARORs  (Fig  ID) 

Distance  to  TSSs  and  genomic  contexts  were  determined  relative 
to  transcript  annotations  in  the  “Known  Genes”  track  of  the  UCSC 
Genome  Browser.  6,200  randomized  ARORs  were  generated  by 
picking  random  regions  from  chromosomes  19  and  20  with 
properties  matched  to  the  actual  LI  ARORs  -  they  had  the  same 
relative  frequencies  on  chromosome  19  vs.  20,  were  matched  for 
size,  and  were  required  to  have  no  more  than  one  missing  probe. 
95%  confidence  intervals  are  defined  as  the  TSS  distances  of  the 
upper  2.5%  and  the  lower  2.5%  of  the  randomized  ARORs. 

Conventional  ChIP  Assay 

ChIP  analyses  on  LNCaP  or  C4-2B  cells  were  performed  as 
described  previously  [23].  DNA  samples  from  ChIP  preparations 
were  analyzed  by  qPCR  using  TaqMan  PCR  Master  Mix 
(Applied  Biosystems,  Branchburg,  NJ).  The  primers  and  probes 
are  listed  in  Table  S4. 

Expression  Microarray 

Hormone-depleted  C4-2B  cells  were  treated  with  DHT  (10  nM) 
or  an  equal  volume  of  ethanol  for  16  hr.  Total  RNA  from  three 
biological  replicates  was  extracted  using  Aurum  Total  RNA  Kit 
(Bio-Rad,  Hercules,  CA)  and  hybridized  to  Human-6  v2  Expression 
BeadChIP  (Ulumina,  San  Diego,  CA)  at  the  USC/Norris  Cancer 
Center  Core  Facility.  To  determine  differentially  expressed  genes, 
we  performed  a  two-tailed  t-test  (equal  variance)  across  three 
DHT—  replicates  and  three  DHT+  replicates  for  all  46,713  array 
features  having  a  valid  measurement  for  all  6  replicates.  To 
determine  an  empirical  null  distribution,  we  generated  1,000 
permutations  of  the  data  matrix  by  randomly  shuffling  the  three 
DHT—  and  three  DHT+  assignments  for  each  array  feature 
(yielding  roughly  47  million  randomized  features).  An  adjusted  p- 
value  was  then  determined  for  each  feature.  These  p-values  were 
converted  to  Expectation  values  (E-values)  by  multiplying  by  the 
number  of  features  (Table  S2).  A  significance  cutoff  of  E<  =  150 
was  chosen  because  it  minimized  the  FDR  (at  about  0.1)  and 
included  all  well-studied  AR  targets  in  prostate  cancer  cells.  For 
AROR  comparisons,  we  only  considered  the  1,232  RefSeq 
transcripts  covered  by  the  chromosome  19  &  20  regions  on  the 
NimbleGen  tiling  array,  and  set  a  significance  cutoff  of  E<  =  5, 
corresponding  to  the  E-value  chosen  for  the  full  genome,  and  also 
occurring  at  the  minimum  achievable  False  Discovery  Rate  (FDR) 
of  0. 1 .  This  cutoff  also  included  all  features  with  fold  changes  of  1 .5 
or  greater,  and  included  all  well-studied  AR  targets  in  prostate  cells. 
An  AROR  was  considered  “adjacent”  to  a  RefSeq  feature  if  it  was 
within  the  annotated  gene  or  one  of  its  introns,  or  if  it  was  upstream 
or  downstream  of  the  gene  with  no  other  intervening  genes. 

Inter-AROR  distances  (Figure  3C  &  D) 

Distances  were  calculated  between  each  pair  of  AcH3— ARORs 
(3C)  and  each  pair  of  AcH3+ ARORs  (3D).  Chromosome  and  size- 
matched  randomized  ARORs  were  generated  as  described  above. 
For  each  trial,  we  generated  sets  with  the  same  number  of  ARORs 
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as  the  experimentally  determined  set.  For  95%  confidence 
interval,  we  took  the  upper  2.5%  and  the  lower  2.5%  of  distances 
for  10,000  independent  trials. 

Construction  of  Plasmids 

The  AROR  sequences  (an  average  of  500  bp  fragment 
surrounding  the  AROR  peak  center)  were  PCR  amplified  from 
C4-2B  genomic  DNA  and  subcloned  upstream  of  a  thymidine 
kinase  (TK)  minimal  promoter-luciferase  vector  [from  Dr  Axel 
Schonthal  (USC)]  in  both  directions.  The  primers  for  subcloning 
are  listed  in  Table  S5. 

Luciferase  Assay  and  DHT  responsiveness 

C4-2B  (1  xlO5  cells/well)  were  plated  in  12-well  plates  and 
grown  in  phenol  red-free  RPMI  1640  containing  5%  CSS  for  2 
days.  Cells  were  then  transfected  with  AROR  containing  TK- 
luciferase  reporter  plasmids  using  Lipofectamine  LTX  Reagent 
(Invitrogen  Corp.,  Carlsbad,  CA)  according  to  the  manufacturer’s 
protocol.  After  transfection,  cells  were  treated  with  DHT  (10  nM) 
or  ethanol  vehicle  for  24  h.  Luciferase  activities  were  measured  as 
previously  described  [23].  For  each  construct,  two  independent 
transformants  were  measured  in  the  DHT—  condition  and  two  in 
the  DHT+  condition.  Log  intensities  were  used  to  perform  a  one 
sided  t-test  (assuming  equal  variance)  to  determine  a  p-value.  We 
used  log  intensities  because  inter-sample  variance  was  correlated 
with  raw  intensity  scores  but  not  with  log  intensity  scores.  The 
same  procedure  was  performed  for  each  construct  in  the  reverse 
orientation,  and  the  minimum  of  the  forward  and  reverse  p-values 
was  recorded.  In  some  cases,  this  procedure  was  repeated  several 
times  for  the  same  construct,  in  which  case  we  took  the  median  p- 
value.  For  Figure  4A,  this  p-value  is  plotted  against  the  intensity 
fold  changes,  which  were  averaged  using  the  geometric  mean. 

Motif  searches 

BioProspector,  MDScan,  and  Weeder  were  used  to  identify 
motifs  de  novo  by  comparing  the  60  LI  ARORs  (we  excluded  A002 
which  has  14  tandem  AREs,  and  A059  which  could  not  be  cloned) 
to  6,200  size-matched  controls.  Each  was  run  to  search  for  motifs 
of  various  lengths  (6-22  for  MDScan  and  BioProspector,  6—12  for 
Weeder).  The  “-T  20”  command  line  parameter  was  used  for 
BioProspector,  and  “-t  30”  was  used  for  MDScan.  Because  these 
de  novo  search  programs  produce  a  large  number  of  highly  similar 
versions  of  the  same  motif  (408  for  MDscan  and  BioProspector, 
1 25  for  Weeder),  we  performed  a  clustering  technique  to  collapse 
them  to  a  relatively  non-redundant  set.  We  examined  each 
possible  pair  of  motifs  A  and  B,  and  measured  the  Jaccard  distance 
between  the  two  motifs,  which  is  defined  as  1  —  where 

count(A&B)  is  the  number  of  overlapping  binding  sites  between  A 
and  B,  and  count(A\B)  is  the  total  number  of  binding  sites  predicted 
for  either  motif.  Motifs  were  then  clustered  using  linear 
hierarchical  clustering  (average  linkage),  and  motifs  with  an 
average  Jaccard  distance  of  0. 1  or  less  were  grouped  together.  The 
motif  with  the  lowest  average  distance  to  all  other  motifs  in  the 
cluster  was  recorded  as  the  prototype  and  used  for  all  subsequent 
analysis.  Weeder  produced  23  clusters,  and  BioProspector  and 
MDScan  results  were  combined  to  yield  22.  To  this  we  added  2 
versions  of  the  ARE  motif  from  JASPAR/ ConSite,  and  5  motifs 
identified  by  CEAS  (ARE,  HNF3a,  NFI,  C/EBP,  and  GR)  [28]  to 
yield  a  total  of  52  enriched  motifs  (Table  S3). 

Motif  enrichment  in  DHT-responsive  ARORs  (Figure  4B) 

For  each  of  the  constructs  with  luciferase  data,  we  counted  the 
number  of  predicted  binding  sites  within  the  construct  for  each  of 


the  52  motifs  enriched  in  ARORs  (4B,  upper),  and  separately 
counted  the  number  of  predicted  sites  for  each  of  the  1 ,326  unique 
motif  pairs  (4B,  lower).  For  each  motif  or  motif  pair,  we  generated 
an  enrichment  significance  score  by  performing  a  one-sided  non- 
parametric  t-test  (assuming  equal  variance)  between  the  binding 
site  counts  in  the  18  strongly  induced  ARORs  vs.  the  19  non- 
responsive  ARORs.  Adjusted  p-values  were  determined  by 
randomly  permuting  the  non-responsive  and  responsive  labels 
1 ,000  times  for  each  motif,  and  determining  an  empirical 
distribution  of  t-test  scores,  which  was  used  to  calibrate  the  actual 
t-test  scores. 

siRNA  Transfection 

C4-2B  (1  x  10’ cells /well)  were  plated  in  6-well  plates  and 
grown  in  phenol  red-free  RPMI  1640  containing  5%  CSS  for  2 
days.  Cells  were  transfected  with  the  siRNA  duplexes  as  indicated 
at  a  final  concentration  of  1 00  nM  using  Oligofectamine  Reagent 
(Invitrogen)  according  to  die  manufacturer’s  instructions.  After 
transfection,  cells  were  grown  in  phenol  red-free  RPMI  1640 
containing  5%  CSS  for  48  hr  and  then  treated  with  DHT  (10  nM) 
or  ethanol  vehicle  for  additional  18  hr.  Total  RNA  extraction  and 
protein  extraction  were  conducted  respectively  for  further 
assessment. 

qRT-PCR 

After  the  indicated  treatments,  total  RNA  from  C4-2B  cells  was 
extracted  using  Auruni  Total  RNA  Kit  (Bio-Rad,  Hercules,  CA). 
cDNA  was  prepared  using  iScript  cDNA  Synthesis  Kit  (Bio-Rad), 
and  qPCR  was  conducted  using  TaqMan  PCR  Master  Mix  or 
SYBR  Green  PCR  Master  Mix  (Applied  Biosystems,  Branchburg, 
NJ).  The  primers  and  probes  are  listed  in  Table  S6.  Triplicate 
PCR  reactions  were  conducted.  GAPDH  mRNA  expression  was 
analyzed  for  each  sample  in  parallel. 

Immunoblotting 

Immunoblotting  were  performed  as  previously  described  using 
the  indicated  antibodies  [50,51], 

Supporting  Information 

Figure  SI  AROR  distribution  in  Chromosomes  21  &  22  of 
LNCaP  cells.  Data  from  Wang  et  al  [3]. 

Found  at:  doi:  10.1 37 1/journal.pone. 0003645. sOOl  (0.09  MB  TIF) 

Figure  S2  Conservation  of  AR  GhIP  and  AcH3  ChIP  non¬ 
coding  regions.  Cumulative  distribution  plots  of  28-way  phastCons 
conservation  scores  (UCSC  phastCons28way)  are  plotted  for  the 
non-exonic  ARORs,  Estrogen  Receptor  Occupied  Regions 
(ERORs)  from  [2],  and  known  enhancer  elements  taken  from 
[52].  “Random”  ARORs  are  the  non-exonic  subset  of  the  size- 
matched  randomized  chromosome  1 9  &  20  regions  described  in 
Figure  ID.  ERORs  and  ARORs  appear  to  be  only  slightly  more 
conserved  than  the  non-coding  genomic  background. 

Found  at:  doi:  10. 137 1/journal.pone. 0003645. s002  (7.99  MB  TIF) 

Figure  S3  Hormone-dependent  gene  expression  in  C4-2B  cells. 
Illumina  expression  arrays  were  used  to  measure  expression  levels 
of  46,713  transcripts  in  three  replicates  before  and  diree  replicates 
after  DHT  exposure  in  C4-2B  cells.  The  student’s  t-test  was  used 
to  determine  statistical  significance,  and  p-values  were  adjusted 
based  on  random  permutations  of  the  full  dataset.  This  volcano 
plot  shows  the  E-value  (number  of  transcripts  at  the  given  p-value 
expected  by  chance)  plotted  against  the  mean  fold  change.  552 
transcripts  up-regulated  at  die  E  =  150  level  (permutation-adjusted 
p-value  =  0.004),  along  with  416  transcripts  down-regulated 
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(permutation-adjusted  p-value  =  0.003),  are  shown  in  the  upper 
two  quadrants,  and  of  the  well-studied  DHT-responsive  genes  in 
prostate  cancer  are  labeled. 

Found  at:  doi:  10.1 37 1 /journal.pone. 0003645. s003  (2.42  MB  TIF) 

Figure  S4  Predicted  ARE  and  coregulator  binding  sites  in  LI 
ARORs.  Binding  sites  for  the  ARE  and  3  coregulator  motifs  are 
shown  for  the  AROR  constructs  either  highly  induced  (upper  left), 
weakly  induced  (lower  left),  unchanged  (upper  right),  and 
repressed  (lower  right)  in  our  luciferase  reporter  activity  assays. 
The  strongly  induced  ARORs  had  a  total  of  70  sites  in  18  ARORs 
(3.9  sites  per  AROR),  while  the  weakly  induced  had  35  sites  in  2 1 
ARORs  (1.7  sites  per  AROR)  and  the  unchanged  had  20  sites  in 
19  ARORs  (1.6  sites  per  AROR). 

Found  at:  doi:  10. 1371  /journal.pone. 0003645. s004  (40.51  MB 
TIF) 

Figure  S5  Normality  of  permuted  tiling  array  moving  averages. 
Individual  log-ratios  from  one  of  the  AR-ChIP  NimbleGen  arrays 
were  permuted  into  “randomized”  moving  averages  (the  number 
of  probes  per  window  were  matched  to  actual  600-bp  windows  on 
the  array).  10,000  randomized  moving  averages  were  sampled  and 
plotted  against  the  standard  normal.  The  results  show  that  they 
largely  follow  a  normal  distribution.  The  deviating  right  tail  comes 
front  enriched  probes,  but  represent  only  a  small  fraction  of  all 
values  and  do  not  effect  the  statistical  testing  described  above 
which  aims  to  reject  against  the  background  distribution,  i.e.  the 
bulk  of  moving  average  scores. 

Found  at:  doi:  10. 1371  /journal.pone. 0003645. s005  (0.88  MB  TIF) 

Figure  S6  Occupancy  of  AR  and  coregulators  in  LNCaP  and 
C4-2B  cells.  LNCaP  and  G4-2B  cells  were  cultured  in  hormone- 
depleted  medium  for  3  days  and  then  treated  with  1 0  nM  DHT  or 
ethanol  vehicle  for  4  h.  Conventional  site-specific  ChIP  assays 
were  performed  with  indicated  antibodies.  The  values  are 
presented  as  percentage  of  input. 

Found  at:  doi:  10. 1371  /journal.pone. 0003645. s006  (3.54  MB  TIF) 

Figure  S7  Occupancy  of  AR  and  coregulators  at  PSA  and  TGM 
loci.  C4-2B  cells  were  cultured  in  hormone-depleted  medium  for  3 
days  and  then  treated  with  10  nM  DHT  or  ethanol  vehicle  for  4  h. 
Conventional  site-specific  ChIP  assays  were  performed  with 
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